Overview
Brought to you by YData
Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 1309 |
| Missing cells | 3721 |
| Missing cells (%) | 13.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 214.9 KiB |
| Average record size in memory | 168.1 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 7 |
| Text | 7 |
Age is highly overall correlated with Age_wiki | High correlation |
Age_wiki is highly overall correlated with Age | High correlation |
Boarded is highly overall correlated with Embarked | High correlation |
Class is highly overall correlated with Lifeboat and 2 other fields | High correlation |
Embarked is highly overall correlated with Boarded | High correlation |
Fare is highly overall correlated with WikiId | High correlation |
Lifeboat is highly overall correlated with Class and 1 other fields | High correlation |
Pclass is highly overall correlated with Class and 2 other fields | High correlation |
Sex is highly overall correlated with Survived | High correlation |
Survived is highly overall correlated with Sex | High correlation |
WikiId is highly overall correlated with Class and 2 other fields | High correlation |
Survived has 418 (31.9%) missing values | Missing |
Age has 263 (20.1%) missing values | Missing |
Cabin has 1014 (77.5%) missing values | Missing |
Lifeboat has 807 (61.7%) missing values | Missing |
Body has 1179 (90.1%) missing values | Missing |
PassengerId is uniformly distributed | Uniform |
WikiId is uniformly distributed | Uniform |
PassengerId has unique values | Unique |
SibSp has 891 (68.1%) zeros | Zeros |
Parch has 1002 (76.5%) zeros | Zeros |
Fare has 17 (1.3%) zeros | Zeros |
Reproduction
| Analysis started | 2024-12-31 07:14:03.805659 |
|---|---|
| Analysis finished | 2024-12-31 07:14:19.772007 |
| Duration | 15.97 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
PassengerId
Real number (ℝ)
Uniform  Unique 
| Distinct | 1309 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 655 |
| Minimum | 1 |
|---|---|
| Maximum | 1309 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 66.4 |
| Q1 | 328 |
| median | 655 |
| Q3 | 982 |
| 95-th percentile | 1243.6 |
| Maximum | 1309 |
| Range | 1308 |
| Interquartile range (IQR) | 654 |
Descriptive statistics
| Standard deviation | 378.02006 |
|---|---|
| Coefficient of variation (CV) | 0.57712986 |
| Kurtosis | -1.2 |
| Mean | 655 |
| Median Absolute Deviation (MAD) | 327 |
| Skewness | 0 |
| Sum | 857395 |
| Variance | 142899.17 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.1% |
| 861 | 1 | 0.1% |
| 879 | 1 | 0.1% |
| 878 | 1 | 0.1% |
| 877 | 1 | 0.1% |
| 876 | 1 | 0.1% |
| 875 | 1 | 0.1% |
| 874 | 1 | 0.1% |
| 873 | 1 | 0.1% |
| 872 | 1 | 0.1% |
| Other values (1299) | 1299 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 1309 | 1 | |
| 1308 | 1 | |
| 1307 | 1 | |
| 1306 | 1 | |
| 1305 | 1 | |
| 1304 | 1 | |
| 1303 | 1 | |
| 1302 | 1 | |
| 1301 | 1 | |
| 1300 | 1 |
Survived
Categorical
High correlation  Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 418 |
| Missing (%) | 31.9% |
| Memory size | 10.4 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 549 | |
| 1.0 | 342 | |
| (Missing) | 418 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 549 | |
| 1.0 | 342 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1440 | |
| . | 891 | |
| 1 | 342 | 12.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2673 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1440 | |
| . | 891 | |
| 1 | 342 | 12.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2673 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1440 | |
| . | 891 | |
| 1 | 342 | 12.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2673 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1440 | |
| . | 891 | |
| 1 | 342 | 12.8% |
Pclass
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.4 KiB |
| 3 | |
|---|---|
| 1 | |
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 1 |
| 3rd row | 3 |
| 4th row | 1 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 709 | |
| 1 | 323 | |
| 2 | 277 | 21.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 709 | |
| 1 | 323 | |
| 2 | 277 | 21.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 709 | |
| 1 | 323 | |
| 2 | 277 | 21.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1309 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 709 | |
| 1 | 323 | |
| 2 | 277 | 21.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1309 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 709 | |
| 1 | 323 | |
| 2 | 277 | 21.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1309 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 709 | |
| 1 | 323 | |
| 2 | 277 | 21.2% |
Name
Text
| Distinct | 1307 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.4 KiB |
Length
| Max length | 82 |
|---|---|
| Median length | 56 |
| Mean length | 27.130634 |
| Min length | 12 |
Unique
| Unique | 1305 ? |
|---|---|
| Unique (%) | 99.7% |
Sample
| 1st row | Braund, Mr. Owen Harris |
|---|---|
| 2nd row | Cumings, Mrs. John Bradley (Florence Briggs Thayer) |
| 3rd row | Heikkinen, Miss. Laina |
| 4th row | Futrelle, Mrs. Jacques Heath (Lily May Peel) |
| 5th row | Allen, Mr. William Henry |
| Value | Count | Frequency (%) |
| mr | 763 | 14.3% |
| miss | 260 | 4.9% |
| mrs | 201 | 3.8% |
| william | 87 | 1.6% |
| john | 72 | 1.3% |
| master | 61 | 1.1% |
| henry | 49 | 0.9% |
| charles | 39 | 0.7% |
| james | 38 | 0.7% |
| george | 37 | 0.7% |
| Other values (1940) | 3742 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4044 | 11.4% | |
| r | 2929 | 8.2% |
| e | 2525 | 7.1% |
| a | 2443 | 6.9% |
| i | 1946 | 5.5% |
| s | 1925 | 5.4% |
| n | 1900 | 5.4% |
| M | 1643 | 4.6% |
| l | 1593 | 4.5% |
| o | 1475 | 4.2% |
| Other values (50) | 13091 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 35514 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4044 | 11.4% | |
| r | 2929 | 8.2% |
| e | 2525 | 7.1% |
| a | 2443 | 6.9% |
| i | 1946 | 5.5% |
| s | 1925 | 5.4% |
| n | 1900 | 5.4% |
| M | 1643 | 4.6% |
| l | 1593 | 4.5% |
| o | 1475 | 4.2% |
| Other values (50) | 13091 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 35514 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4044 | 11.4% | |
| r | 2929 | 8.2% |
| e | 2525 | 7.1% |
| a | 2443 | 6.9% |
| i | 1946 | 5.5% |
| s | 1925 | 5.4% |
| n | 1900 | 5.4% |
| M | 1643 | 4.6% |
| l | 1593 | 4.5% |
| o | 1475 | 4.2% |
| Other values (50) | 13091 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 35514 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4044 | 11.4% | |
| r | 2929 | 8.2% |
| e | 2525 | 7.1% |
| a | 2443 | 6.9% |
| i | 1946 | 5.5% |
| s | 1925 | 5.4% |
| n | 1900 | 5.4% |
| M | 1643 | 4.6% |
| l | 1593 | 4.5% |
| o | 1475 | 4.2% |
| Other values (50) | 13091 |
Sex
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.4 KiB |
| male | |
|---|---|
| female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.7119939 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | male |
|---|---|
| 2nd row | female |
| 3rd row | female |
| 4th row | female |
| 5th row | male |
Common Values
| Value | Count | Frequency (%) |
| male | 843 | |
| female | 466 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 843 | |
| female | 466 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1775 | |
| m | 1309 | |
| a | 1309 | |
| l | 1309 | |
| f | 466 | 7.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6168 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1775 | |
| m | 1309 | |
| a | 1309 | |
| l | 1309 | |
| f | 466 | 7.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6168 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1775 | |
| m | 1309 | |
| a | 1309 | |
| l | 1309 | |
| f | 466 | 7.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6168 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1775 | |
| m | 1309 | |
| a | 1309 | |
| l | 1309 | |
| f | 466 | 7.6% |
Age
Real number (ℝ)
High correlation  Missing 
| Distinct | 98 |
|---|---|
| Distinct (%) | 9.4% |
| Missing | 263 |
| Missing (%) | 20.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.881138 |
| Minimum | 0.17 |
|---|---|
| Maximum | 80 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.4 KiB |
Quantile statistics
| Minimum | 0.17 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 21 |
| median | 28 |
| Q3 | 39 |
| 95-th percentile | 57 |
| Maximum | 80 |
| Range | 79.83 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 14.413493 |
|---|---|
| Coefficient of variation (CV) | 0.48236093 |
| Kurtosis | 0.14694764 |
| Mean | 29.881138 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.40767456 |
| Sum | 31255.67 |
| Variance | 207.74879 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 47 | 3.6% |
| 22 | 43 | 3.3% |
| 21 | 41 | 3.1% |
| 30 | 40 | 3.1% |
| 18 | 39 | 3.0% |
| 25 | 34 | 2.6% |
| 28 | 32 | 2.4% |
| 36 | 31 | 2.4% |
| 26 | 30 | 2.3% |
| 27 | 30 | 2.3% |
| Other values (88) | 679 | |
| (Missing) | 263 | 20.1% |
| Value | Count | Frequency (%) |
| 0.17 | 1 | 0.1% |
| 0.33 | 1 | 0.1% |
| 0.42 | 1 | 0.1% |
| 0.67 | 1 | 0.1% |
| 0.75 | 3 | 0.2% |
| 0.83 | 3 | 0.2% |
| 0.92 | 2 | 0.2% |
| 1 | 10 | |
| 2 | 12 | |
| 3 | 7 |
| Value | Count | Frequency (%) |
| 80 | 1 | 0.1% |
| 76 | 1 | 0.1% |
| 74 | 1 | 0.1% |
| 71 | 2 | 0.2% |
| 70.5 | 1 | 0.1% |
| 70 | 2 | 0.2% |
| 67 | 1 | 0.1% |
| 66 | 1 | 0.1% |
| 65 | 3 | |
| 64 | 5 |
SibSp
Real number (ℝ)
Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.49885409 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 891 |
| Zeros (%) | 68.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.0416584 |
|---|---|
| Coefficient of variation (CV) | 2.0881023 |
| Kurtosis | 20.043251 |
| Mean | 0.49885409 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.8442203 |
| Sum | 653 |
| Variance | 1.0850522 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 891 | |
| 1 | 319 | 24.4% |
| 2 | 42 | 3.2% |
| 4 | 22 | 1.7% |
| 3 | 20 | 1.5% |
| 8 | 9 | 0.7% |
| 5 | 6 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 891 | |
| 1 | 319 | 24.4% |
| 2 | 42 | 3.2% |
| 3 | 20 | 1.5% |
| 4 | 22 | 1.7% |
| 5 | 6 | 0.5% |
| 8 | 9 | 0.7% |
| Value | Count | Frequency (%) |
| 8 | 9 | 0.7% |
| 5 | 6 | 0.5% |
| 4 | 22 | 1.7% |
| 3 | 20 | 1.5% |
| 2 | 42 | 3.2% |
| 1 | 319 | 24.4% |
| 0 | 891 |
Parch
Real number (ℝ)
Zeros 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.38502674 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 1002 |
| Zeros (%) | 76.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.86556028 |
|---|---|
| Coefficient of variation (CV) | 2.2480524 |
| Kurtosis | 21.541079 |
| Mean | 0.38502674 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.6690782 |
| Sum | 504 |
| Variance | 0.74919459 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1002 | |
| 1 | 170 | 13.0% |
| 2 | 113 | 8.6% |
| 3 | 8 | 0.6% |
| 5 | 6 | 0.5% |
| 4 | 6 | 0.5% |
| 6 | 2 | 0.2% |
| 9 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 1002 | |
| 1 | 170 | 13.0% |
| 2 | 113 | 8.6% |
| 3 | 8 | 0.6% |
| 4 | 6 | 0.5% |
| 5 | 6 | 0.5% |
| 6 | 2 | 0.2% |
| 9 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 9 | 2 | 0.2% |
| 6 | 2 | 0.2% |
| 5 | 6 | 0.5% |
| 4 | 6 | 0.5% |
| 3 | 8 | 0.6% |
| 2 | 113 | 8.6% |
| 1 | 170 | 13.0% |
| 0 | 1002 |
Ticket
Text
| Distinct | 929 |
|---|---|
| Distinct (%) | 71.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.4 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 6.7906799 |
| Min length | 3 |
Unique
| Unique | 713 ? |
|---|---|
| Unique (%) | 54.5% |
Sample
| 1st row | A/5 21171 |
|---|---|
| 2nd row | PC 17599 |
| 3rd row | STON/O2. 3101282 |
| 4th row | 113803 |
| 5th row | 373450 |
| Value | Count | Frequency (%) |
| pc | 92 | 5.5% |
| c.a | 46 | 2.7% |
| ca | 22 | 1.3% |
| a/5 | 22 | 1.3% |
| 2 | 17 | 1.0% |
| soton/o.q | 16 | 1.0% |
| sc/paris | 16 | 1.0% |
| ston/o | 14 | 0.8% |
| w./c | 14 | 0.8% |
| 2343 | 11 | 0.7% |
| Other values (960) | 1403 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 1110 | |
| 1 | 1000 | |
| 2 | 862 | |
| 7 | 697 | 7.8% |
| 4 | 652 | 7.3% |
| 6 | 628 | 7.1% |
| 0 | 610 | 6.9% |
| 5 | 582 | 6.5% |
| 9 | 465 | 5.2% |
| 8 | 426 | 4.8% |
| Other values (25) | 1857 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8889 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 1110 | |
| 1 | 1000 | |
| 2 | 862 | |
| 7 | 697 | 7.8% |
| 4 | 652 | 7.3% |
| 6 | 628 | 7.1% |
| 0 | 610 | 6.9% |
| 5 | 582 | 6.5% |
| 9 | 465 | 5.2% |
| 8 | 426 | 4.8% |
| Other values (25) | 1857 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8889 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 1110 | |
| 1 | 1000 | |
| 2 | 862 | |
| 7 | 697 | 7.8% |
| 4 | 652 | 7.3% |
| 6 | 628 | 7.1% |
| 0 | 610 | 6.9% |
| 5 | 582 | 6.5% |
| 9 | 465 | 5.2% |
| 8 | 426 | 4.8% |
| Other values (25) | 1857 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8889 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 1110 | |
| 1 | 1000 | |
| 2 | 862 | |
| 7 | 697 | 7.8% |
| 4 | 652 | 7.3% |
| 6 | 628 | 7.1% |
| 0 | 610 | 6.9% |
| 5 | 582 | 6.5% |
| 9 | 465 | 5.2% |
| 8 | 426 | 4.8% |
| Other values (25) | 1857 |
Fare
Real number (ℝ)
High correlation  Zeros 
| Distinct | 281 |
|---|---|
| Distinct (%) | 21.5% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33.295479 |
| Minimum | 0 |
|---|---|
| Maximum | 512.3292 |
| Zeros | 17 |
| Zeros (%) | 1.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.225 |
| Q1 | 7.8958 |
| median | 14.4542 |
| Q3 | 31.275 |
| 95-th percentile | 133.65 |
| Maximum | 512.3292 |
| Range | 512.3292 |
| Interquartile range (IQR) | 23.3792 |
Descriptive statistics
| Standard deviation | 51.758668 |
|---|---|
| Coefficient of variation (CV) | 1.5545254 |
| Kurtosis | 27.027986 |
| Mean | 33.295479 |
| Median Absolute Deviation (MAD) | 6.9042 |
| Skewness | 4.3677091 |
| Sum | 43550.487 |
| Variance | 2678.9597 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.05 | 60 | 4.6% |
| 13 | 59 | 4.5% |
| 7.75 | 55 | 4.2% |
| 26 | 50 | 3.8% |
| 7.8958 | 49 | 3.7% |
| 10.5 | 35 | 2.7% |
| 7.775 | 26 | 2.0% |
| 7.2292 | 24 | 1.8% |
| 7.925 | 23 | 1.8% |
| 26.55 | 22 | 1.7% |
| Other values (271) | 905 |
| Value | Count | Frequency (%) |
| 0 | 17 | |
| 3.1708 | 1 | 0.1% |
| 4.0125 | 1 | 0.1% |
| 5 | 1 | 0.1% |
| 6.2375 | 1 | 0.1% |
| 6.4375 | 3 | 0.2% |
| 6.45 | 1 | 0.1% |
| 6.4958 | 3 | 0.2% |
| 6.75 | 2 | 0.2% |
| 6.8583 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 512.3292 | 4 | |
| 263 | 6 | |
| 262.375 | 7 | |
| 247.5208 | 3 | |
| 227.525 | 5 | |
| 221.7792 | 4 | |
| 211.5 | 5 | |
| 211.3375 | 4 | |
| 164.8667 | 4 | |
| 153.4625 | 3 |
Cabin
Text
Missing 
| Distinct | 186 |
|---|---|
| Distinct (%) | 63.1% |
| Missing | 1014 |
| Missing (%) | 77.5% |
| Memory size | 10.4 KiB |
Length
| Max length | 15 |
|---|---|
| Median length | 3 |
| Mean length | 3.7389831 |
| Min length | 1 |
Unique
| Unique | 107 ? |
|---|---|
| Unique (%) | 36.3% |
Sample
| 1st row | C85 |
|---|---|
| 2nd row | C123 |
| 3rd row | E46 |
| 4th row | G6 |
| 5th row | C103 |
| Value | Count | Frequency (%) |
| f | 8 | 2.2% |
| c23 | 6 | 1.7% |
| c27 | 6 | 1.7% |
| c25 | 6 | 1.7% |
| b57 | 5 | 1.4% |
| b59 | 5 | 1.4% |
| b63 | 5 | 1.4% |
| b66 | 5 | 1.4% |
| g6 | 5 | 1.4% |
| f4 | 4 | 1.1% |
| Other values (192) | 301 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 114 | 10.3% |
| 2 | 97 | 8.8% |
| B | 96 | 8.7% |
| 1 | 94 | 8.5% |
| 3 | 87 | 7.9% |
| 6 | 81 | 7.3% |
| 5 | 79 | 7.2% |
| 61 | 5.5% | |
| 4 | 58 | 5.3% |
| 8 | 51 | 4.6% |
| Other values (9) | 285 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1103 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 114 | 10.3% |
| 2 | 97 | 8.8% |
| B | 96 | 8.7% |
| 1 | 94 | 8.5% |
| 3 | 87 | 7.9% |
| 6 | 81 | 7.3% |
| 5 | 79 | 7.2% |
| 61 | 5.5% | |
| 4 | 58 | 5.3% |
| 8 | 51 | 4.6% |
| Other values (9) | 285 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1103 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 114 | 10.3% |
| 2 | 97 | 8.8% |
| B | 96 | 8.7% |
| 1 | 94 | 8.5% |
| 3 | 87 | 7.9% |
| 6 | 81 | 7.3% |
| 5 | 79 | 7.2% |
| 61 | 5.5% | |
| 4 | 58 | 5.3% |
| 8 | 51 | 4.6% |
| Other values (9) | 285 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1103 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 114 | 10.3% |
| 2 | 97 | 8.8% |
| B | 96 | 8.7% |
| 1 | 94 | 8.5% |
| 3 | 87 | 7.9% |
| 6 | 81 | 7.3% |
| 5 | 79 | 7.2% |
| 61 | 5.5% | |
| 4 | 58 | 5.3% |
| 8 | 51 | 4.6% |
| Other values (9) | 285 |
Embarked
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 2 |
| Missing (%) | 0.2% |
| Memory size | 10.4 KiB |
| S | |
|---|---|
| C | |
| Q |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S |
|---|---|
| 2nd row | C |
| 3rd row | S |
| 4th row | S |
| 5th row | S |
Common Values
| Value | Count | Frequency (%) |
| S | 914 | |
| C | 270 | 20.6% |
| Q | 123 | 9.4% |
| (Missing) | 2 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| s | 914 | |
| c | 270 | 20.7% |
| q | 123 | 9.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 914 | |
| C | 270 | 20.7% |
| Q | 123 | 9.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1307 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 914 | |
| C | 270 | 20.7% |
| Q | 123 | 9.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1307 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 914 | |
| C | 270 | 20.7% |
| Q | 123 | 9.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1307 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 914 | |
| C | 270 | 20.7% |
| Q | 123 | 9.4% |
WikiId
Real number (ℝ)
High correlation  Uniform 
| Distinct | 1304 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 658.53451 |
| Minimum | 1 |
|---|---|
| Maximum | 1314 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 66.15 |
| Q1 | 326.75 |
| median | 661.5 |
| Q3 | 987.25 |
| 95-th percentile | 1248.85 |
| Maximum | 1314 |
| Range | 1313 |
| Interquartile range (IQR) | 660.5 |
Descriptive statistics
| Standard deviation | 380.37737 |
|---|---|
| Coefficient of variation (CV) | 0.57761191 |
| Kurtosis | -1.2052155 |
| Mean | 658.53451 |
| Median Absolute Deviation (MAD) | 330.5 |
| Skewness | -0.0074106982 |
| Sum | 858729 |
| Variance | 144686.95 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 951 | 1 | 0.1% |
| 842 | 1 | 0.1% |
| 1311 | 1 | 0.1% |
| 328 | 1 | 0.1% |
| 1278 | 1 | 0.1% |
| 54 | 1 | 0.1% |
| 27 | 1 | 0.1% |
| 667 | 1 | 0.1% |
| 903 | 1 | 0.1% |
| 1277 | 1 | 0.1% |
| Other values (1294) | 1294 | |
| (Missing) | 5 | 0.4% |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 1314 | 1 | |
| 1313 | 1 | |
| 1312 | 1 | |
| 1311 | 1 | |
| 1310 | 1 | |
| 1309 | 1 | |
| 1308 | 1 | |
| 1307 | 1 | |
| 1306 | 1 | |
| 1305 | 1 |
Name_wiki
Text
| Distinct | 1303 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 5 |
| Missing (%) | 0.4% |
| Memory size | 10.4 KiB |
Length
| Max length | 69 |
|---|---|
| Median length | 53 |
| Mean length | 27.32362 |
| Min length | 12 |
Unique
| Unique | 1302 ? |
|---|---|
| Unique (%) | 99.8% |
Sample
| 1st row | Braund, Mr. Owen Harris |
|---|---|
| 2nd row | Cumings, Mrs. Florence Briggs (née Thayer) |
| 3rd row | Heikkinen, Miss Laina |
| 4th row | Futrelle, Mrs. Lily May (née Peel) |
| 5th row | Allen, Mr. William Henry |
| Value | Count | Frequency (%) |
| mr | 756 | 13.9% |
| miss | 267 | 4.9% |
| mrs | 195 | 3.6% |
| née | 179 | 3.3% |
| william | 69 | 1.3% |
| master | 61 | 1.1% |
| john | 61 | 1.1% |
| and | 41 | 0.8% |
| henry | 41 | 0.8% |
| mary | 38 | 0.7% |
| Other values (2001) | 3727 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4131 | 11.6% | |
| r | 2834 | 8.0% |
| e | 2571 | 7.2% |
| a | 2493 | 7.0% |
| n | 2102 | 5.9% |
| i | 1932 | 5.4% |
| s | 1918 | 5.4% |
| M | 1644 | 4.6% |
| l | 1522 | 4.3% |
| o | 1385 | 3.9% |
| Other values (82) | 13098 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 35630 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4131 | 11.6% | |
| r | 2834 | 8.0% |
| e | 2571 | 7.2% |
| a | 2493 | 7.0% |
| n | 2102 | 5.9% |
| i | 1932 | 5.4% |
| s | 1918 | 5.4% |
| M | 1644 | 4.6% |
| l | 1522 | 4.3% |
| o | 1385 | 3.9% |
| Other values (82) | 13098 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 35630 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4131 | 11.6% | |
| r | 2834 | 8.0% |
| e | 2571 | 7.2% |
| a | 2493 | 7.0% |
| n | 2102 | 5.9% |
| i | 1932 | 5.4% |
| s | 1918 | 5.4% |
| M | 1644 | 4.6% |
| l | 1522 | 4.3% |
| o | 1385 | 3.9% |
| Other values (82) | 13098 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 35630 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4131 | 11.6% | |
| r | 2834 | 8.0% |
| e | 2571 | 7.2% |
| a | 2493 | 7.0% |
| n | 2102 | 5.9% |
| i | 1932 | 5.4% |
| s | 1918 | 5.4% |
| M | 1644 | 4.6% |
| l | 1522 | 4.3% |
| o | 1385 | 3.9% |
| Other values (82) | 13098 |
Age_wiki
Real number (ℝ)
High correlation 
| Distinct | 78 |
|---|---|
| Distinct (%) | 6.0% |
| Missing | 7 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.415829 |
| Minimum | 0.17 |
|---|---|
| Maximum | 74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.4 KiB |
Quantile statistics
| Minimum | 0.17 |
|---|---|
| 5-th percentile | 6.05 |
| Q1 | 21 |
| median | 28 |
| Q3 | 37.75 |
| 95-th percentile | 55 |
| Maximum | 74 |
| Range | 73.83 |
| Interquartile range (IQR) | 16.75 |
Descriptive statistics
| Standard deviation | 13.758954 |
|---|---|
| Coefficient of variation (CV) | 0.4677398 |
| Kurtosis | 0.17307643 |
| Mean | 29.415829 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.43161917 |
| Sum | 38299.41 |
| Variance | 189.30882 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 22 | 62 | 4.7% |
| 24 | 56 | 4.3% |
| 21 | 51 | 3.9% |
| 28 | 47 | 3.6% |
| 18 | 46 | 3.5% |
| 19 | 45 | 3.4% |
| 25 | 45 | 3.4% |
| 20 | 45 | 3.4% |
| 30 | 43 | 3.3% |
| 29 | 42 | 3.2% |
| Other values (68) | 820 |
| Value | Count | Frequency (%) |
| 0.17 | 1 | 0.1% |
| 0.33 | 1 | 0.1% |
| 0.42 | 1 | 0.1% |
| 0.58 | 1 | 0.1% |
| 0.75 | 2 | 0.2% |
| 0.83 | 3 | 0.2% |
| 0.92 | 1 | 0.1% |
| 1 | 11 | |
| 2 | 13 | |
| 3 | 7 |
| Value | Count | Frequency (%) |
| 74 | 1 | 0.1% |
| 71 | 3 | |
| 70 | 1 | 0.1% |
| 69 | 1 | 0.1% |
| 67 | 1 | 0.1% |
| 66 | 2 | 0.2% |
| 65 | 2 | 0.2% |
| 64 | 5 | |
| 63 | 6 | |
| 62 | 6 |
Hometown
Text
| Distinct | 566 |
|---|---|
| Distinct (%) | 43.4% |
| Missing | 5 |
| Missing (%) | 0.4% |
| Memory size | 10.4 KiB |
Length
| Max length | 49 |
|---|---|
| Median length | 39 |
| Mean length | 23.526074 |
| Min length | 6 |
Unique
| Unique | 348 ? |
|---|---|
| Unique (%) | 26.7% |
Sample
| 1st row | Bridgerule, Devon, England |
|---|---|
| 2nd row | New York, New York, US |
| 3rd row | Jyväskylä, Finland |
| 4th row | Scituate, Massachusetts, US |
| 5th row | Birmingham, West Midlands, England |
| Value | Count | Frequency (%) |
| england | 318 | 7.9% |
| us | 291 | 7.2% |
| new | 212 | 5.2% |
| york | 186 | 4.6% |
| ireland | 120 | 3.0% |
| sweden | 106 | 2.6% |
| london | 98 | 2.4% |
| uk | 66 | 1.6% |
| lebanon | 65 | 1.6% |
| finland | 56 | 1.4% |
| Other values (758) | 2525 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 2809 | 9.2% |
| 2739 | 8.9% | |
| a | 2403 | 7.8% |
| , | 2217 | 7.2% |
| e | 2160 | 7.0% |
| o | 1706 | 5.6% |
| r | 1679 | 5.5% |
| l | 1424 | 4.6% |
| i | 1211 | 3.9% |
| d | 1179 | 3.8% |
| Other values (78) | 11151 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 30678 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 2809 | 9.2% |
| 2739 | 8.9% | |
| a | 2403 | 7.8% |
| , | 2217 | 7.2% |
| e | 2160 | 7.0% |
| o | 1706 | 5.6% |
| r | 1679 | 5.5% |
| l | 1424 | 4.6% |
| i | 1211 | 3.9% |
| d | 1179 | 3.8% |
| Other values (78) | 11151 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 30678 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 2809 | 9.2% |
| 2739 | 8.9% | |
| a | 2403 | 7.8% |
| , | 2217 | 7.2% |
| e | 2160 | 7.0% |
| o | 1706 | 5.6% |
| r | 1679 | 5.5% |
| l | 1424 | 4.6% |
| i | 1211 | 3.9% |
| d | 1179 | 3.8% |
| Other values (78) | 11151 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 30678 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 2809 | 9.2% |
| 2739 | 8.9% | |
| a | 2403 | 7.8% |
| , | 2217 | 7.2% |
| e | 2160 | 7.0% |
| o | 1706 | 5.6% |
| r | 1679 | 5.5% |
| l | 1424 | 4.6% |
| i | 1211 | 3.9% |
| d | 1179 | 3.8% |
| Other values (78) | 11151 |
Boarded
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 5 |
| Missing (%) | 0.4% |
| Memory size | 10.4 KiB |
| Southampton | |
|---|---|
| Cherbourg | |
| Queenstown | |
| Belfast | 10 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.480828 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Southampton |
|---|---|
| 2nd row | Cherbourg |
| 3rd row | Southampton |
| 4th row | Southampton |
| 5th row | Southampton |
Common Values
| Value | Count | Frequency (%) |
| Southampton | 916 | |
| Cherbourg | 259 | 19.8% |
| Queenstown | 119 | 9.1% |
| Belfast | 10 | 0.8% |
| (Missing) | 5 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| southampton | 916 | |
| cherbourg | 259 | 19.9% |
| queenstown | 119 | 9.1% |
| belfast | 10 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2210 | |
| t | 1961 | |
| u | 1294 | |
| h | 1175 | |
| n | 1154 | |
| a | 926 | |
| S | 916 | |
| m | 916 | |
| p | 916 | |
| r | 518 | 3.8% |
| Other values (10) | 1681 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 13667 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 2210 | |
| t | 1961 | |
| u | 1294 | |
| h | 1175 | |
| n | 1154 | |
| a | 926 | |
| S | 916 | |
| m | 916 | |
| p | 916 | |
| r | 518 | 3.8% |
| Other values (10) | 1681 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 13667 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 2210 | |
| t | 1961 | |
| u | 1294 | |
| h | 1175 | |
| n | 1154 | |
| a | 926 | |
| S | 916 | |
| m | 916 | |
| p | 916 | |
| r | 518 | 3.8% |
| Other values (10) | 1681 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 13667 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 2210 | |
| t | 1961 | |
| u | 1294 | |
| h | 1175 | |
| n | 1154 | |
| a | 926 | |
| S | 916 | |
| m | 916 | |
| p | 916 | |
| r | 518 | 3.8% |
| Other values (10) | 1681 |
Destination
Text
| Distinct | 291 |
|---|---|
| Distinct (%) | 22.3% |
| Missing | 5 |
| Missing (%) | 0.4% |
| Memory size | 10.4 KiB |
Length
| Max length | 39 |
|---|---|
| Median length | 32 |
| Mean length | 21.516104 |
| Min length | 2 |
Unique
| Unique | 145 ? |
|---|---|
| Unique (%) | 11.1% |
Sample
| 1st row | Qu'Appelle Valley, Saskatchewan, Canada |
|---|---|
| 2nd row | New York, New York, US |
| 3rd row | New York City |
| 4th row | Scituate, Massachusetts, US |
| 5th row | New York City |
| Value | Count | Frequency (%) |
| us | 926 | |
| new | 661 | |
| york | 582 | 12.9% |
| city | 247 | 5.5% |
| canada | 125 | 2.8% |
| illinois | 100 | 2.2% |
| pennsylvania | 99 | 2.2% |
| chicago | 75 | 1.7% |
| michigan | 72 | 1.6% |
| jersey | 60 | 1.3% |
| Other values (347) | 1562 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3205 | 11.4% | |
| , | 2094 | 7.5% |
| a | 1831 | 6.5% |
| o | 1808 | 6.4% |
| e | 1783 | 6.4% |
| n | 1639 | 5.8% |
| i | 1582 | 5.6% |
| r | 1331 | 4.7% |
| t | 1067 | 3.8% |
| S | 1033 | 3.7% |
| Other values (46) | 10684 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 28057 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3205 | 11.4% | |
| , | 2094 | 7.5% |
| a | 1831 | 6.5% |
| o | 1808 | 6.4% |
| e | 1783 | 6.4% |
| n | 1639 | 5.8% |
| i | 1582 | 5.6% |
| r | 1331 | 4.7% |
| t | 1067 | 3.8% |
| S | 1033 | 3.7% |
| Other values (46) | 10684 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 28057 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3205 | 11.4% | |
| , | 2094 | 7.5% |
| a | 1831 | 6.5% |
| o | 1808 | 6.4% |
| e | 1783 | 6.4% |
| n | 1639 | 5.8% |
| i | 1582 | 5.6% |
| r | 1331 | 4.7% |
| t | 1067 | 3.8% |
| S | 1033 | 3.7% |
| Other values (46) | 10684 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 28057 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3205 | 11.4% | |
| , | 2094 | 7.5% |
| a | 1831 | 6.5% |
| o | 1808 | 6.4% |
| e | 1783 | 6.4% |
| n | 1639 | 5.8% |
| i | 1582 | 5.6% |
| r | 1331 | 4.7% |
| t | 1067 | 3.8% |
| S | 1033 | 3.7% |
| Other values (46) | 10684 |
Lifeboat
Categorical
High correlation  Missing 
| Distinct | 24 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 807 |
| Missing (%) | 61.7% |
| Memory size | 10.4 KiB |
| 13 | |
|---|---|
| C | |
| 15 | |
| 14 | |
| 4 | 31 |
| Other values (19) |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.4342629 |
| Min length | 1 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 14? |
| 3rd row | D |
| 4th row | 15 |
| 5th row | ? |
Common Values
| Value | Count | Frequency (%) |
| 13 | 42 | 3.2% |
| C | 41 | 3.1% |
| 15 | 38 | 2.9% |
| 14 | 34 | 2.6% |
| 4 | 31 | 2.4% |
| 5 | 29 | 2.2% |
| 10 | 29 | 2.2% |
| 9 | 26 | 2.0% |
| 11 | 26 | 2.0% |
| 3 | 26 | 2.0% |
| Other values (14) | 180 | 13.8% |
| (Missing) | 807 |
Length
| Value | Count | Frequency (%) |
| 13 | 42 | 8.4% |
| c | 41 | 8.2% |
| 15 | 39 | 7.8% |
| 14 | 35 | 7.0% |
| 4 | 31 | 6.2% |
| 5 | 29 | 5.8% |
| 10 | 29 | 5.8% |
| 9 | 26 | 5.2% |
| 11 | 26 | 5.2% |
| 3 | 26 | 5.2% |
| Other values (12) | 178 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 243 | |
| 3 | 68 | 9.4% |
| 5 | 68 | 9.4% |
| 4 | 67 | 9.3% |
| 6 | 45 | 6.2% |
| C | 41 | 5.7% |
| 2 | 32 | 4.4% |
| 0 | 29 | 4.0% |
| 9 | 26 | 3.6% |
| 8 | 24 | 3.3% |
| Other values (7) | 77 | 10.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 720 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 243 | |
| 3 | 68 | 9.4% |
| 5 | 68 | 9.4% |
| 4 | 67 | 9.3% |
| 6 | 45 | 6.2% |
| C | 41 | 5.7% |
| 2 | 32 | 4.4% |
| 0 | 29 | 4.0% |
| 9 | 26 | 3.6% |
| 8 | 24 | 3.3% |
| Other values (7) | 77 | 10.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 720 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 243 | |
| 3 | 68 | 9.4% |
| 5 | 68 | 9.4% |
| 4 | 67 | 9.3% |
| 6 | 45 | 6.2% |
| C | 41 | 5.7% |
| 2 | 32 | 4.4% |
| 0 | 29 | 4.0% |
| 9 | 26 | 3.6% |
| 8 | 24 | 3.3% |
| Other values (7) | 77 | 10.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 720 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 243 | |
| 3 | 68 | 9.4% |
| 5 | 68 | 9.4% |
| 4 | 67 | 9.3% |
| 6 | 45 | 6.2% |
| C | 41 | 5.7% |
| 2 | 32 | 4.4% |
| 0 | 29 | 4.0% |
| 9 | 26 | 3.6% |
| 8 | 24 | 3.3% |
| Other values (7) | 77 | 10.7% |
Body
Text
Missing 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1179 |
| Missing (%) | 90.1% |
| Memory size | 10.4 KiB |
Length
| Max length | 14 |
|---|---|
| Median length | 5 |
| Mean length | 4.7461538 |
| Min length | 3 |
Unique
| Unique | 130 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 175MB |
|---|---|
| 2nd row | 322M |
| 3rd row | 38MB |
| 4th row | 234MB |
| 5th row | 181MB |
| Value | Count | Frequency (%) |
| 96mb | 1 | 0.8% |
| 306mb | 1 | 0.8% |
| 322m | 1 | 0.8% |
| 38mb | 1 | 0.8% |
| 234mb | 1 | 0.8% |
| 181mb | 1 | 0.8% |
| 309m | 1 | 0.8% |
| 140mb | 1 | 0.8% |
| 240{?}mb | 1 | 0.8% |
| 283mb | 1 | 0.8% |
| Other values (120) | 120 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 128 | |
| B | 120 | |
| 1 | 63 | |
| 2 | 58 | |
| 3 | 37 | 6.0% |
| 8 | 30 | 4.9% |
| 6 | 29 | 4.7% |
| 9 | 29 | 4.7% |
| 5 | 28 | 4.5% |
| 7 | 28 | 4.5% |
| Other values (8) | 67 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 617 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 128 | |
| B | 120 | |
| 1 | 63 | |
| 2 | 58 | |
| 3 | 37 | 6.0% |
| 8 | 30 | 4.9% |
| 6 | 29 | 4.7% |
| 9 | 29 | 4.7% |
| 5 | 28 | 4.5% |
| 7 | 28 | 4.5% |
| Other values (8) | 67 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 617 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 128 | |
| B | 120 | |
| 1 | 63 | |
| 2 | 58 | |
| 3 | 37 | 6.0% |
| 8 | 30 | 4.9% |
| 6 | 29 | 4.7% |
| 9 | 29 | 4.7% |
| 5 | 28 | 4.5% |
| 7 | 28 | 4.5% |
| Other values (8) | 67 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 617 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 128 | |
| B | 120 | |
| 1 | 63 | |
| 2 | 58 | |
| 3 | 37 | 6.0% |
| 8 | 30 | 4.9% |
| 6 | 29 | 4.7% |
| 9 | 29 | 4.7% |
| 5 | 28 | 4.5% |
| 7 | 28 | 4.5% |
| Other values (8) | 67 |
Class
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 5 |
| Missing (%) | 0.4% |
| Memory size | 10.4 KiB |
| 3.0 | |
|---|---|
| 1.0 | |
| 2.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 3.0 |
| 4th row | 1.0 |
| 5th row | 3.0 |
Common Values
| Value | Count | Frequency (%) |
| 3.0 | 706 | |
| 1.0 | 326 | |
| 2.0 | 272 | 20.8% |
| (Missing) | 5 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3.0 | 706 | |
| 1.0 | 326 | |
| 2.0 | 272 | 20.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 1304 | |
| 0 | 1304 | |
| 3 | 706 | |
| 1 | 326 | 8.3% |
| 2 | 272 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3912 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 1304 | |
| 0 | 1304 | |
| 3 | 706 | |
| 1 | 326 | 8.3% |
| 2 | 272 | 7.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3912 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 1304 | |
| 0 | 1304 | |
| 3 | 706 | |
| 1 | 326 | 8.3% |
| 2 | 272 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3912 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 1304 | |
| 0 | 1304 | |
| 3 | 706 | |
| 1 | 326 | 8.3% |
| 2 | 272 | 7.0% |
Interactions
Correlations
| Age | Age_wiki | Boarded | Class | Embarked | Fare | Lifeboat | Parch | PassengerId | Pclass | Sex | SibSp | Survived | WikiId | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.980 | 0.073 | 0.294 | 0.063 | 0.193 | 0.169 | -0.216 | 0.027 | 0.297 | 0.091 | -0.130 | 0.155 | -0.350 |
| Age_wiki | 0.980 | 1.000 | 0.104 | 0.323 | 0.119 | 0.209 | 0.151 | -0.209 | 0.022 | 0.325 | 0.072 | -0.145 | 0.132 | -0.353 |
| Boarded | 0.073 | 0.104 | 1.000 | 0.290 | 0.935 | 0.176 | 0.455 | 0.063 | 0.000 | 0.288 | 0.140 | 0.101 | 0.180 | 0.249 |
| Class | 0.294 | 0.323 | 0.290 | 1.000 | 0.279 | 0.475 | 0.707 | 0.036 | 0.044 | 0.992 | 0.118 | 0.153 | 0.340 | 0.899 |
| Embarked | 0.063 | 0.119 | 0.935 | 0.279 | 1.000 | 0.218 | 0.450 | 0.093 | 0.033 | 0.278 | 0.116 | 0.114 | 0.166 | 0.294 |
| Fare | 0.193 | 0.209 | 0.176 | 0.475 | 0.218 | 1.000 | 0.356 | 0.400 | -0.004 | 0.482 | 0.185 | 0.446 | 0.283 | -0.637 |
| Lifeboat | 0.169 | 0.151 | 0.455 | 0.707 | 0.450 | 0.356 | 1.000 | 0.105 | 0.035 | 0.706 | 0.418 | 0.176 | 0.349 | 0.308 |
| Parch | -0.216 | -0.209 | 0.063 | 0.036 | 0.093 | 0.400 | 0.105 | 1.000 | -0.006 | 0.036 | 0.236 | 0.438 | 0.157 | -0.042 |
| PassengerId | 0.027 | 0.022 | 0.000 | 0.044 | 0.033 | -0.004 | 0.035 | -0.006 | 1.000 | 0.034 | 0.000 | -0.032 | 0.035 | -0.044 |
| Pclass | 0.297 | 0.325 | 0.288 | 0.992 | 0.278 | 0.482 | 0.706 | 0.036 | 0.034 | 1.000 | 0.119 | 0.153 | 0.337 | 0.891 |
| Sex | 0.091 | 0.072 | 0.140 | 0.118 | 0.116 | 0.185 | 0.418 | 0.236 | 0.000 | 0.119 | 1.000 | 0.187 | 0.540 | 0.133 |
| SibSp | -0.130 | -0.145 | 0.101 | 0.153 | 0.114 | 0.446 | 0.176 | 0.438 | -0.032 | 0.153 | 0.187 | 1.000 | 0.187 | -0.076 |
| Survived | 0.155 | 0.132 | 0.180 | 0.340 | 0.166 | 0.283 | 0.349 | 0.157 | 0.035 | 0.337 | 0.540 | 0.187 | 1.000 | 0.345 |
| WikiId | -0.350 | -0.353 | 0.249 | 0.899 | 0.294 | -0.637 | 0.308 | -0.042 | -0.044 | 0.891 | 0.133 | -0.076 | 0.345 | 1.000 |
Missing values
Sample
| PassengerId | Survived | Pclass | Name | Sex | Age | SibSp | Parch | Ticket | Fare | Cabin | Embarked | WikiId | Name_wiki | Age_wiki | Hometown | Boarded | Destination | Lifeboat | Body | Class | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 0.0 | 3 | Braund, Mr. Owen Harris | male | 22.0 | 1 | 0 | A/5 21171 | 7.2500 | NaN | S | 691.0 | Braund, Mr. Owen Harris | 22.0 | Bridgerule, Devon, England | Southampton | Qu'Appelle Valley, Saskatchewan, Canada | NaN | NaN | 3.0 |
| 1 | 2 | 1.0 | 1 | Cumings, Mrs. John Bradley (Florence Briggs Thayer) | female | 38.0 | 1 | 0 | PC 17599 | 71.2833 | C85 | C | 90.0 | Cumings, Mrs. Florence Briggs (née Thayer) | 35.0 | New York, New York, US | Cherbourg | New York, New York, US | 4 | NaN | 1.0 |
| 2 | 3 | 1.0 | 3 | Heikkinen, Miss. Laina | female | 26.0 | 0 | 0 | STON/O2. 3101282 | 7.9250 | NaN | S | 865.0 | Heikkinen, Miss Laina | 26.0 | Jyväskylä, Finland | Southampton | New York City | 14? | NaN | 3.0 |
| 3 | 4 | 1.0 | 1 | Futrelle, Mrs. Jacques Heath (Lily May Peel) | female | 35.0 | 1 | 0 | 113803 | 53.1000 | C123 | S | 127.0 | Futrelle, Mrs. Lily May (née Peel) | 35.0 | Scituate, Massachusetts, US | Southampton | Scituate, Massachusetts, US | D | NaN | 1.0 |
| 4 | 5 | 0.0 | 3 | Allen, Mr. William Henry | male | 35.0 | 0 | 0 | 373450 | 8.0500 | NaN | S | 627.0 | Allen, Mr. William Henry | 35.0 | Birmingham, West Midlands, England | Southampton | New York City | NaN | NaN | 3.0 |
| 5 | 6 | 0.0 | 3 | Moran, Mr. James | male | NaN | 0 | 0 | 330877 | 8.4583 | NaN | Q | 785.0 | Doherty, Mr. William John (aka "James Moran") | 22.0 | Cork, Ireland | Queenstown | New York City | NaN | NaN | 3.0 |
| 6 | 7 | 0.0 | 1 | McCarthy, Mr. Timothy J | male | 54.0 | 0 | 0 | 17463 | 51.8625 | E46 | S | 200.0 | McCarthy, Mr. Timothy J. | 54.0 | Dorchester, Massachusetts, US | Southampton | Dorchester, Massachusetts, US | NaN | 175MB | 1.0 |
| 7 | 8 | 0.0 | 3 | Palsson, Master. Gosta Leonard | male | 2.0 | 3 | 1 | 349909 | 21.0750 | NaN | S | 1108.0 | Pålsson, Master Gösta Leonard | 2.0 | Bjuv, Skåne, Sweden | Southampton | Chicago, Illinois, US | NaN | NaN | 3.0 |
| 8 | 9 | 1.0 | 3 | Johnson, Mrs. Oscar W (Elisabeth Vilhelmina Berg) | female | 27.0 | 0 | 2 | 347742 | 11.1333 | NaN | S | 902.0 | Johnson, Mrs. Elisabeth Vilhelmina (née Berg) | 26.0 | St. Charles, Illinois, US | Southampton | St. Charles, Illinois, US | 15 | NaN | 3.0 |
| 9 | 10 | 1.0 | 2 | Nasser, Mrs. Nicholas (Adele Achem) | female | 14.0 | 1 | 0 | 237736 | 30.0708 | NaN | C | 520.0 | Nassr Allah, Mrs. Adal (née Akim)[62][77] | 14.0 | Zahlé, Lebanon, Ottoman Empire | Cherbourg | Cleveland, Ohio, US | ? | NaN | 2.0 |
| PassengerId | Survived | Pclass | Name | Sex | Age | SibSp | Parch | Ticket | Fare | Cabin | Embarked | WikiId | Name_wiki | Age_wiki | Hometown | Boarded | Destination | Lifeboat | Body | Class | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1299 | 1300 | NaN | 3 | Riordan, Miss. Johanna Hannah"" | female | NaN | 0 | 0 | 334915 | 7.7208 | NaN | Q | 1154.0 | Riordan, Miss Hannah | 18.0 | Glenlougha, Cork, Ireland | Queenstown | New York City | 13 | NaN | 3.0 |
| 1300 | 1301 | NaN | 3 | Peacock, Miss. Treasteall | female | 3.0 | 1 | 1 | SOTON/O.Q. 3101315 | 13.7750 | NaN | S | 1119.0 | Peacock, Miss Treasteall | 4.0 | Southampton, Hampshire, England | Southampton | Elizabeth, New Jersey, US | NaN | NaN | 3.0 |
| 1301 | 1302 | NaN | 3 | Naughton, Miss. Hannah | female | NaN | 0 | 0 | 365237 | 7.7500 | NaN | Q | 1064.0 | Naughton, Miss Hannah | 21.0 | Donoughmore, Ireland | Queenstown | New York City | NaN | NaN | 3.0 |
| 1302 | 1303 | NaN | 1 | Minahan, Mrs. William Edward (Lillian E Thorpe) | female | 37.0 | 1 | 0 | 19928 | 90.0000 | C78 | Q | 206.0 | Minahan, Mrs. Lillian E. (née Thorpe) | 37.0 | Fond du Lac, Wisconsin, US | Southampton | Fond du Lac, Wisconsin, US | 14 | NaN | 1.0 |
| 1303 | 1304 | NaN | 3 | Henriksson, Miss. Jenny Lovisa | female | 28.0 | 0 | 0 | 347086 | 7.7750 | NaN | S | 869.0 | Henriksson, Miss Jenny Lovisa | 28.0 | Stockholm, Sweden | Southampton | Iron Mountain, Michigan, US | NaN | 3MB | 3.0 |
| 1304 | 1305 | NaN | 3 | Spector, Mr. Woolf | male | NaN | 0 | 0 | A.5. 3236 | 8.0500 | NaN | S | 1227.0 | Spector, Mr. Woolf | 23.0 | London, England | Southampton | New York City | NaN | NaN | 3.0 |
| 1305 | 1306 | NaN | 1 | Oliva y Ocana, Dona. Fermina | female | 39.0 | 0 | 0 | PC 17758 | 108.9000 | C105 | C | 229.0 | and maid, Doña Fermina Oliva y Ocana | 39.0 | Madrid, Spain | Cherbourg | New York, New York, US | 8 | NaN | 1.0 |
| 1306 | 1307 | NaN | 3 | Saether, Mr. Simon Sivertsen | male | 38.5 | 0 | 0 | SOTON/O.Q. 3101262 | 7.2500 | NaN | S | 1169.0 | Sæther, Mr. Simon Sivertsen | 43.0 | Skaun, Sør-Trøndelag, Norway | Southampton | US | NaN | 32MB | 3.0 |
| 1307 | 1308 | NaN | 3 | Ware, Mr. Frederick | male | NaN | 0 | 0 | 359309 | 8.0500 | NaN | S | 1289.0 | Ware, Mr. Frederick William | 34.0 | Greenwich, London, England | Southampton | New York City | NaN | NaN | 3.0 |
| 1308 | 1309 | NaN | 3 | Peter, Master. Michael J | male | NaN | 1 | 1 | 2668 | 22.3583 | NaN | C | 702.0 | Butrus-Youssef, Master Makhkhul | 4.0 | Sar'al[81], Syria | Cherbourg | Detroit, Michigan, US | D | NaN | 3.0 |